One-Unambiguous Regular Languages
نویسندگان
چکیده
The ISO standard for the Standard Generalized Markup Language (SGML) provides a syntactic meta-language for the definition of textual markup systems. In the standard, the right-hand sides of productions are based on regular expressions, although only regular expressions that denote words unambiguously, in the sense of the ISO standard, are allowed. In general, a word that is denoted by a regular expression is witnessed by a sequence of occurrences of symbols in the regular expression that match the word. In an unambiguous regular expression as defined by Book et al. (1971, IEEE Trans. Comput. C-20(2), 149 153), each word has at most one witness. But the SGML standard also requires that a witness be computed incrementally from the word with a onesymbol lookahead; we call such regular expressions 1-unambiguous. A regular language is a 1-unambiguous language if it is denoted by some 1-unambiguous regular expression. We give a Kleene theorem for 1-unambiguous languages and characterize 1-unambiguous regular languages in terms of structural properties of the minimal deterministic automata that recognize them. As a result we are able to prove the decidability of whether a given regular expression denotes a 1-unambiguous language; if it does, then we can construct an equivalent 1-unambiguous regular expression in worst-case optimal time. ] 1998
منابع مشابه
Unambiguous languages exhaust the index hierarchy
This work is a study of the expressive power of unambiguity in the case of infinite trees. An automaton is called unambiguous if it has at most one accepting run on every input, the language of such an automaton is called an unambiguous language. It is known that not every regular language of infinite trees is unambiguous. Except that, very little is known about which regular tree languages are...
متن کاملOn the Hierarchy of Block Deterministic Languages
A regular language is k-lookahead deterministic (resp. k-block deterministic) if it is specified by a k-lookahead deterministic (resp. k-block deterministic) regular expression. These two subclasses of regular languages have been respectively introduced by Han and Wood (k-lookahead determinism) and by Giammarresi et al. (k-block determinism) as a possible extension of one-unambiguous languages ...
متن کاملUnambiguity and uniformization problems on infinite trees
A nondeterministic automaton is called unambiguous if it has at most one accepting run on every input. A regular language is called unambiguous if there exists an unambiguous automaton recognizing this language. Currently, the class of unambiguous languages of infinite trees is not well-understood. In particular, there is no known decision procedure verifying if a given regular tree language is...
متن کاملJa n 20 14 Unambiguous Büchi is weak
A non-deterministic automaton running on infinite trees is unambiguous if it has at most one accepting run on every tree. The class of languages recognisable by unambiguous tree automata is still not well-understood. In particular, decidability of the problem whether a given language is recognisable by some unambiguous automaton is open. Moreover, there are no known upper bounds on the descript...
متن کاملBoolean algebras of unambiguous context-free languages
Several recent works have studied subfamilies of deterministic context-free languages with good closure properties, for instance the families of input-driven or visibly pushdown languages, ormore generally families of languages accepted by pushdown automatawhose stack height can be uniquely determined by the input word read so far. These ideas can be described as a notion of synchronization. In...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Inf. Comput.
دوره 142 شماره
صفحات -
تاریخ انتشار 1998